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Abstract. Consider a random matrix A £ Qrnxn ( m > n ) containing independent complex 
Gaussian entries with zero mean and unit variance, and let < Ai < A2 < . . . < X n < 00 denote 
the eigenvalues of A*A where (■)* represents conjugate-transpose. This paper investigates the dis- 
tribution of the random variables — L for fc = 1 and k = 2. These two variables are related 

T— 1 A fc 

. to certain condition number metrics, including the so-called Demmel condition number, which have 

■ been shown to arise in a variety of applications. For both cases, we derive new exact expressions 

for the probability densities, and establish the asymptotic behavior as the matrix dimensions grow 
large. In particular, it is shown that as n and m tend to infinity with their difference fixed, both 
densities scale on the order of n 3 . After suitable transformations, we establish exact expressions 
for the asymptotic densities, obtaining simple closed-form expressions in some cases. Our results 
generalize the work of Edelman on the Demmel condition number for the case m = n. 

<N 

Key words. Demmel condition number, Eigenvalues, Random matrix, Wishart distribution 
I— 1_ 

AMS subject classifications. 15B52, 65F35, 15A18, 62H10 

CO; 

1- Introduction. Understanding the sensitivity of outcomes to initial condi- 
tions in certain iterative algorithms is important for the design and analysis of many 
physical systems. Example algorithms include iterative methods used in linear alge- 
bra, interior-point methods of convex optimization, and polynomial zero finding [5J. 
Various condition numbers have been defined as a measure of the sensitivity of the 
solutions with respect to small perturbations of the input. One of the earliest studies 

by Turin |45| on iterative algorithms related to matrix inversion and the solution of 

large systems of linear equations defined two such condition numbers as a measure of 
the degree of ill-conditioning in a matrix. In particular, for a matrix A € R™ xn , the 
first was referred to as the TV-condition number, defined as N (A) N (A~ x ) / n with N(-) 
representing the Frobenius norm, whilst the second was referred to as the M -condition 
number, defined as M(A)M (A _1 )/n with M(-) denoting the operator returning the 
largest absolute entry of the matrix. However, probably the best known condition 
number, introduced in [3B], takes the form re(A) = 1 1 A| 1 2 1 1 A" 1 1 1 2 , where A g c nx ™ 
and || • H2 denotes the 2-norm. This condition number, similar to the ./V-condition 
number and M -condition number, is important in problems involving matrix inversion 
and solutions of linear equations. 

As conjectured in [38], the calculation of a condition number pertaining to a 
problem with a certain deterministic data set is as hard as solving the problem itself 
with the data set. Whilst very difficult to prove in general, a proof of this conjecture 
has been established for conic condition numbers in |12j . To gain further insights 
into the behavior of the condition number, probabilistic analysis based on certain 
probability measures on the data has been employed; see [51 H^l[lH[2l71r^2l[3^1HfJllT7] 
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for a partial list. In general, most existing literature dealing with the probabilistic 
analysis of condition numbers has focused on one of two main perspectives. The first is 
to establish bounds on the tail of the distribution of k(A), which is important for the 
geometrical characterization of the condition number 17 . Central to the geometric 
characterization is the fact that k(A) is proportional to the reciprocal of the distance 
to the set of ill-conditioned matrices [17] . The second has been to establish bounds 
on the expectation of ln(/i(A)), which is important for characterizing the average 
loss of numerical precision and average running time [51 [22] of iterative algorithms. 
In general, the "randomness" of the condition number is introduced by selecting the 
elements of A to be standard independent normal real/complex random variables 

[23 [221 ID]. 

Whilst in this paper we focus primarily on a probabilistic analysis, it is worth 
noting that a deterministic version of the problems considered above, related to the 
inversion of positive-definite Hankel (or moment) matrices appearing naturally in 
random matrix theory, involves the determination of the smallest eigenvalues of Hankel 
matrices of order n. See [5] and [U [pj H3[ [50] for contributions dealing with this 
problem. 

Demmel, in his seminal paper |17| . introduced a new condition number of conic 
type, defined for square n x n matrix A as 

KD (A) = ||A|| F ||A- 1 || 2 , (1.1) 

where || • \\f is the Frobenius norm. This, along with a theorem due to Eckart and 
Young [15], permits a geometrical characterization [51 [T7], which has enabled the 
calculation of bounds on the tail distribution of kd(A) with the help of integral 
geometry. A random variable similar to k_d(A) also arises in problems dealing with 
the entanglement of a bi-partite quantum system [21 [8]. As demonstrated in [161 
117] . the condition numbers arising in various contexts, including matrix inversion, 
eigenvalue calculation, polynomial zero finding, as well as pole assignment in linear 
control systems, can be bounded by «d(A). The definition (jl.l[) extends naturally to 
rectangular matrices by replacing A -1 in (jl.lj) with A^ (also known as the Moore- 
Penrose inverse or the pseudo-inverse) to yield 

« D (A) = ||A|| J r||At|| a . (1.2) 

For A G C mx " with rank(A) = r (< min(m, n)), (|1.2I) simplifies to 



MA) = y%^, (i.3) 

where Ai < A 2 < . . . < A r are the non-zero eigenvalues of A* A (or AA*) with 
(■)* denoting conjugate-transpose. In this paper, we deal exclusively with Gaussian 
random matrices A £ c mx ™ (to > n) having independent complex standard normal 
entriefl Therefore, (|1.3p can be written as 



KD (A) = j5=^, (1.4) 



1 Henceforth, when referring to "Gaussian matrices", we will implicitly assume that the matrices 
have independent standard complex normal entries; i.e., this will not be explicitly stated. 
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which follows from the fact that the matrix A* A is positive-definite with probability 
one [T51ED] ■ This form of the Demmel condition number was used by Krishnaiah et. al. 
in a sequence of papers [27J HH US] to derive the inferences on certain sub-hypotheses 
when the total hypothesis, which is composed of various component sub-hypotheses, 
is rejected. Recently, it was shown in [6] that the condition number of the form 21) 
is useful in analyzing the loss of precision in the computation of the solution to the 
classical linear least squares problem. Also, the statistical properties of the Dem- 
mel condition number K£>(A) (for arbitrary m and n) have recently found important 
applications in the design and analysis of contemporary wireless communication sys- 
tems. Specific examples include the design of adaptive multi-antenna transmission 
techniques [24] . and the modeling of physical multi-antenna transmission channels 

m 

From the discussion given above, there is clear motivation for studying the statis- 
tical properties of the Demmel condition number of random matrices. Here we review 
some of the key existing contributions dealing with this problem. These contributions 
include (6[ [TjJ (20[ [2TJ [22] , which investigated the exact distributions as well as bounds 
on the tail probabilities over various regimes depending on the size of the random 
matrix. For instance, Demmel showed that for Gaussian matrices A £ C nx ™ |17) . 

(1 - 1/x) 2 " 2 - 2 eVHl + rvYz) 2 " 2 - 2 
2^2 < Pr(MA) > *)< — 2 , 

while Edelman concluded that as n — ► oo |21j , 

_ {2k d (A) \ {A 
Pr \ — - < x -> exp 



n 2 



For Gaussian matrices A G ^mxn w ^jj m > n> on e could also exploit the bound 
kd(A) < y / n«(A) (following from the fact that ||A||jr < y^n||A||2) along with the 
upper bound on the distribution tail of k(A) given in |1 1 1 Theorem 4.6] to derive an 
upper bound on the distribution tail of kd(A). This, in turn, reveals that the r?l 2 
asymptotic scaling order observed in [21] for square Gaussian matrices also serves as an 
upper bound on the scaling order for rectangular Gaussian matrices. The exact scaling 
order, however, has yet to be determined. In addition to these results, more recently, 
various exact closed-form expressions for the distribution of the Demmel condition 
number for Gaussian matrices A G c mx " have been reported in J3TJ [52j [49] . These 
results, however, are rather complicated and they become unwieldy computationally 
when the matrix dimensions are not small. Moreover, the expressions in [3TJ[52J[3H] are 
not suitable for understanding the behavior of the scaled Demmel condition number 
as the matrix dimensions grow large. In this respect, only for the case m = n given in 
[3TJ [52J [49] , the asymptotic behavior of the exact probability density function (p.d.f.) 
of the Demmel condition number is known. Establishing the asymptotic properties 
for more general m x n (m < n) matrices is one of the key objectives to be addressed 
in this paper. 



In addition to «d(A), other related metrics of the form Afc/ X)f=i = 1, 2, 



, n 



have been considered by Krishnaiah et. al. 27, 28, 29] in certain hypothesis testing 
problems. Therein, they employed a result due to Davis [T5] . which gives an expression 
for the p.d.f. of ^k/Y^j=i by establishing a Laplace transform relationship with 
respect to the p.d.f. of A*,. However, the expressions obtained are very complicated, 
both analytically and numerically. This complexity, in turn, does not easily facilitate 
the understanding of the asymptotic behavior of the p.d.f. as the matrix dimensions 



4 



P. DHARMAWANSA, M. R. MCKAY and Y. CHEN 



become large. In this paper, we tackle this problem by adopting a different approach, 
making use of techniques developed in |32j . We focus our analysis on the quantities, 
K%(A) and 

4(A) = 



A 2 ' 

whilst noting that our approach may also pave the way for studying more generalized 

YJ 71 A 

metrics of the form — J ^ J , k = 1,2, ... ,n. We derive new expressions for the exact 
and asymptotic distributions of n 2 D (A) and n 2 B {A) for Gaussian matrices A <E c mx ™ 
(m > n) by adopting a moment generating function (m.g.f.) based approach. We 
show the interesting result that both n 2 D (A) and K%(A) scale on the order of n 3 when 
to and 77 tend to infinity in such a way that 777 — 77 remains a fixed integer. These 
results agree with and generalize the scaling behavior obtained previously by Edelman 
in [21] for 77 x 77 Gaussian matrices. The scaled asymptotic p.d.f. which we derive for 
n 2 D (A) is expressed in closed form for arbitrary 777 > 77, whilst for n 2 E (A) it involves 
a single finite-range integral for the general case and a closed-form solution for the 
scenario 777 = 77. 

2. Preliminaries. To facilitate our main derivations, we will require the follow- 
ing preliminary results and definitions. 

Definition 2.1. Let the elements of A £ C rnxn (m > n) be independent and 
identically distributed complex standard normal variables. Then the matrix W = A* A 
is said to follow a complex Wishart distribution, i.e., W ~ }Vn(m,l n ). 

Theorem 2.2. The joint density of the ordered eigenvalues < Ai < A2 < . . . < 
A n < 00 of W is given by \25f 

n 

f(X 1 ,X 2 ,..., A„) = K n>a A 2 n (X) 11 \?e- x i (2.1) 

where, for X = {Ai, A 2 , . . . , A„}, A„(A) := Yli<j <k < n (^k - ^j), a = m — n, and 

K n ,a=nl (n;=o(j + l)!(j + «)!) _1 - 

Lemma 2.3. For p > —1, the generalized Laguerre polynomial of degree N, 
L^-\z), is defined by \44V : 



N 



L N (z) - -^— 1 F 1 (-N,p+l,z) - ^ J-^jy.J , (2-2) 



3=0 



with k th derivative 



^L%\z) = (-l)*L<£$(z), (2.3) 

where (a)j — a(a + 1) . . . (a + j — 1) with (a)o = 1 is the Pochhammer symbol and 
iFi(a; c; z) is the confluent hypergeometric function of the first kind. 

Lemma 2.4. The monomial z n can be expanded in terms of the Stirling number 
of the second kind, Sn , as follows {I]/: 

n 

z n = J2 z(z - l)(z - 2) • • • (z - j + 1), (2.4) 
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where 



rn 



j=0 



7 (m\ m! 

ana . = ... .,, . 

Finally, we use the following notation to compactly represent the determinant of 
an N x N block matrix: 



det [aij 6j,fe-2]i=i,2,...,JV 



i=i,2 

fc 3,4 V 



0-1,1 0-1,2 &1,2 ■ • • &l,JV-2 

12,1 «2,2 &2,1 ^2.2 ■ • ■ &2,N-2 



0JV,1 a iV,2 &AT,1 ^Af,2 ■ • • &iV,iV-2 



(2.5) 



3. M.g.f. and p.d.f. of re|,(A). In this section we derive new expressions for 
the m.g.f. and the p.d.f. of n 2 D (A). Our approach follows along similar lines to [32l 
Chap. 22]. 

The m.g.f. of k 2 d {A.) is given by 



M k 2 d{a) {s) = E 



e A i /(Ai,A2,...,A„)dA 2 ...dA„dAi, 



'0 JRi 

where 7?.i = {Ai < A2 < . . . < A„ < 00}. Let Ai = x. An easy manipulation gives 

„ S"=2 x j 



■M k 2 d(a) (s) = e 



JTZt 



f (x, A2, . • . , A n ) dA 2 . . . dX n dx. (3.1) 



The theorem below gives a closed-form representation for the m.g.f. 
Theorem 3.1. The m.g.f. of n 2 D (A) is given by 



I p—ns f°° rj.ran—1 ^—nx 



(m- 1)1 J {s + xy 



det 



7-C+1) 1 , «>| 



fe./=l,2.. 



dx. 

(3.2) 



Proof. Substituting (|2.ip into (|3.ip we find, keeping the x integration last, 

POO j p n 

M K 2 W (s) = K n>a e~ s / aTe-* { / [[(X, - xf\^- x ^) 

JO [ JKl j=2 



x Yl ( A j _ A ') 2 dA 2 . . • dA n f dx. 

2<i<j<n 



Relabeling variables as Xj = Xj-\, j = 2, 3, . . . , n, and exploiting symmetry to remove 
the ordered region of integration (i.e., achieved by dividing through by (n — 1)!) gives 



M k 2 d(a) (s) 



"F 3 !)! Jo 



Y[( Xj -x) 2 x*e- x ^ 1+ ^) 

[x.,00)"-- 1 j = 1 

x A 2 l _ 1 (x) dx\dx2 ■ ■ .dx n -i > dx. (3.3) 
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Now we apply the change of variables x/j — ( x+s > {xj — x) , j = 1, 2, . . . , n — 1 to the 
inner (n — l)-fold integral in ()3.3|) with some algebraic manipulation to obtain 

— nx 

__ ( _ 1)Q („-l )g(n _ ^ ft) _ x _ g) 



(n — lj! 7 ( x + s ) 

where we have defined 
Q(n,a,z) := 



n It, 

/ ( y ) [] y 2 e -« (z - % .) Q dyidy 2 . . . dy n 



(3.4) 



(3.5) 



The remaining task is to obtain a closed- form solution to Q(n,a, z). We point 
out that a solution to a generalization of this integral, not restricting a to an integer, 
has been derived in [TU1 [35] as a solution to a Painleve V equation. Here, for a an 
integer, we establish a much simpler closed-form algebraic solution. For this purpose, 
we employ a result from random matrix theory [32[ Section 22.2.2], which gives 



r Ai 



Q(n, a,z) — b Q(n, 0, z) det 



^jC'n+fc^) 



fc,J=0,l,...,a-l 



where b = 



For our problem Cj (x) are monic polynomials orthogo- 



nal with respect to the weight x 2 e x , over < x < oo. We see that Cj(x) 
(— 1) J j! (x). Hence (|3.5p becomes 



Q(n, a, z) = b Q{n, 0, z) det 



(2) 



dz r 



a-1 



6Q(7i,0,z)(-l)" tt ]J(n + i)! dct 

3=0 



(2+i) 
n+fc— / 



fc,Z=0,l,...,a-l 



fc,/=0,l....,Q-l 



(3.6) 



Moreover, we have Q(n, 0, z) = Ilj=o + j)K^ + which when used with (I3.6|) in 
(13. 4p . followed by translating the indices from k, I = 0, 1, . . . , a — 1 to k, I = 1, 2, . . . , a 
gives ([33]) . □ 

Now we take the inverse Laplace transform of (|3.2|) to arrive at the p.d.f. of 
k|j(A), which is given below. 

Corollary 3.2. TTie p.d.f. of k 2 d (A) is given by 



r(mn) 



(n + a-1)! y ri 



dct 



fej=l,2,. 



(3.7) 



where C (■) denotes the inverse Laplace transform. 

Interestingly, by noting that the p.d.f. of the minimum eigenvalue Ai takes the 
form 



/min(^) 



Kn 



(n-1)! 



f(x,\ 2 ,...,\ n )d\2...d\„ 



Kn ' a x a e~ nx {-l) a{n ~^Q{n - 1, a, -x) 



(n-1)! 
n! 

(n + a- 1)!" 



+ (__ T V 



fc,i=l,2. 



(3.8) 
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we can obtain the following alternative representation for the p.d.f. of k d (A.) 

{a} = r(mn) , [ /min(s) 

Jk 2 d (A)^"> ymn 1 s mn-l 

This turns out to be a simpler representation of an equivalent relation given previously 
in [29j (obtaining one relation from the other, however, appears to be non-trivial). 
This simplified form results as a consequence of the m.g.f. derivation approach, in 
contrast to the p.d.f.-based approach in |15j . We also mention that the minimum 
eigenvalue p.d.f. (|3.8[) fixes a sign problem with a result given in [23j Eq. 3.12]. 

The new expression (13.7[) facilitates the exact evaluation of the p.d.f. of k%(A) in 
closed-form, for any value of a. This is given in the following key theorem: 

Theorem 3.3. The exact p.d.f. of n 2 D (&) is given by 



k 



(k + iy. 



\fc=0 
n-\-a— 2 n—1 



31=0 



j a =0 \k=l 

X 



A Q (c) 



(3.9) 



r (ran -a -I- J2t=i 3k) 



—H(y-n) , 



where c = {c\(ji), C2O2), • ■ • , c a (j a )} with ci(ji) — I + ji, and H(z) denote the Heav- 
iside unit step function, i.e., H(z) = 1, z > 0, and H(z) =0, z < 0. 
Proof. We use (12.2[) to write the determinant term in (|3.7I) as 

det [L^HU-s) 

_ (n + k)l 



k,i=l,2,. 



l = l( fc+ l) 



■det 



1 



n+k-l-l 



{n + k-l-l)\ 



E 

31=0 



(-n-fc + l + l),, 



(/ + 2) 



(3.10) 



Further manipulation in this form is difficult due to the dependence of the summation 
upper limits on k and I. To circumvent this problem, we use the factorization 

(-n - k + I + _ (-n - k + I + l) n (-71 - a + I + 



(I + % 



(— n — a + / + l)j, 



(/ + 2), 



(n + k-l - 1)! (-71 - a + I + l) jt 



a—k—1 



(n + a-l-l)l 



(1 + 2), 



n 



« ! t=0 

where c;=n + a— 1 — — Z, in (|3.10[) with some algebraic manipulation to obtain 

(71 + a)\(n + a - 1)! 



det 



2,...,a n!(n-l)!nLi(fc + l)' 



ro+a— 2 n—1 / a 

>< e -E n 

31=0 J o =0 \fc=l 



(-71 - a + fc + 



(-s) Jfe det 



-fc-i 



n 



L t=0 



-I fe,i=l,2,...,o: 

(3.11) 
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Now, invoking Lemma [A. II in the Appendix, then substituting (13. lip into (|3.7[) yields 

x A^C- 1 { , e ™ . \ . 

Finally, the result (|3.9[) follows upon carrying out the remaining Laplace inversion 
using [33 Eq. 1.1.2.1]. □ 

For some small values of a, (13.91) admits the following simple forms. 

Corollary 3.4. The exact p.d.f.s of k 2 d {&) corresponding to a — and a = l 
are given, respectively, by 

/S(A)(y) = «(« 2 - - nf- 2 y- n2 H( V - n) , 

fH(A)(y) = r(n ^, + 1)] n(n + - ra) n(n + l)-3 y -n(n + l) 

^ V ; (3)ii! r(n(n + 1) - i - 2) V!/ y 



i=0 



The expression for a = agrees with a previous result given in [21| . 

Remark 1. Whilst previous equivalent expressions have been derived in iSll 
\52\ \4^j , the exact p.d.f. of k 2 d (A) given in \3.9\) is a generalized and/or simpler 
representation. Indeed, noting that the number of nested summations depends only on 
a, this formula provides an efficient way of evaluating the p.d.f. of k 2 d {A), particularly 
for small values of a. Moreover, since the algebraic complexity depends only on n and 
the difference of m and n, this in turn makes our result \3. 9\) very useful for conducting 
an asymptotic analysis of k 2 d (A) as m and n grow large, but their difference does not 
(something which appears infeasible with previous expressions in \31[ \5S[ \4^l )- This 
is the objective of the next section. 

4. Asymptotic Characterization of k 2 d {A). In this section, we employ the 
exact p.d.f. representation (|3.9p to investigate the distribution of n 2 D {A), suitably 
scaled, for fixed a when m, n — > oo. We have the following key result: 

Theorem 4.1. As m and n tend to oo such that a — m — n is fixed, k 2 d (A) scales 
on the order of v? . More specifically, the scaled random variable V = k 2 d (A)/ (/in 3 ), 
with /! 6 M + an arbitrary constant, has the following asymptotic p.d.f. as m and n 
tend to oo with a = m — n fixed: 



jJLV 



where 



— — '■' 1\ J '+i+i 



I= oj=o v 1 ' (H 2 



H(v), (4.1) 

k,l=l,2,...,a 



k\(n + k)\ (2) 



is the modified Bessel function of the first kind of order n. 
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Proof. We arrange the terms in (I3.9[) . noting that m = n + a, to obtain 



\k=0 

n+a— 2 n— 1 / a 



i(n+a) — a — 2 



x E -E mc-i: 

31=0 j Q =0 \fe=l 



3 k 



X 



(— n — a + fc + 1 
A Q (c) 



" f l - - 



-a-2 



-Jk 



r (n(n + a) - a - 1 - J2k=i 3k 



-H(y-n), (4.2) 



Now we have to choose a suitable scaling for the variable n 2 D (A) in terms of n so 
that the above p.d.f. converges as n — > oo. Careful thought reveals that the term 

)n(n+ct) 
converges to a finite non-zero limit as n — > oo if we scale k 2 d (A) 

proportional to n 3 . For this reason, we introduce the scaled random variable V = 
K|)(A)/(/in 3 ), and focus on the function /in 3 /^ 3y (/in 3 u) as n — > oo. To this end, 
with y = /j,n 3 v, by using elementary limiting arguments it can be shown that 



lim nn 3 fH v (pn 3 v) 



f a ^ \ L- oo oo / a ^ ^ 

H (fcTT)! J ^ + i va+ 2 E---E o (^11 (fc + 2 ) Jfc Jfe ! (H^ 



n 



1 (k + 1)! J ^+i v a+2 



det 



E 



fe-i 



fc,i=l,2, 



j A Q (c)fl-(«) 
H(v). 

(4.3) 



We now focus on simplifying the determinant. To this end, we use the binomial 
theorem and the definition of the Stirling number (|2.4|) to arrive at 



det 



E 

31=0 

= det 



(l+jl) 



k-l 



fej=l,2,. 



which can be simplified upon noting the relations 



5 (' + ')' ^U+j^L 



oFi (-,l+j + 2;—) =(l+j + l)\(fj,v)^Ii +j+1 (-?= 
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with qFi (— ;p;z) — X^feLo zk /(p)kk } - denoting the generalized hypergeometric func- 
tion, to obtain 



det 



E 

31=0 



(i + jiT 



k— 1 



1 



(i+2) jl j l \( l ivy> 



f[(k + l)\ det 



fc=l 



fe-1 i 

EE 

»=o j=0 



fc,/=l,2 



jk— 1— i c (j) 



2 

7<i' 



fc.i=l,2, 



Finally, using this result in (|4.3p with some algebraic manipulation, and noting that 
[m 3 fMT3y(p>n 3 v) denotes the p.d.f. of the new variable V, concludes the proof. □ 

Remark 2. Clearly, the above approach of deriving the p.d.f. of the asymptotic 
scaled version of n 2 D (A) explicitly depends on the availability of a closed-form expres- 
sion for the p.d.f. of k 2 d (A). Interestingly, one can directly manipulate the m.g.f. of 
n 2 D (A) instead of the p.d.f. to yield the same asymptotic density. Although we do not 
demonstrate it here, we exploit this technique in deriving the scaled asymptotic p.d.f. 
of n 2 E (A) in section 6. 

The exact asymptotic p.d.f. of the scaled k 2 d (A) / '(fin 3 ) takes the following simple 
forms for small values of a. 

Corollary 4.2. 

For a — 0, 1, 2, the result J^.l[ ) becomes: 



f$\v) = —e-^H(v), 
f^\v) = — 2 e-^{l 



1 



f^'(v)^—e-^L 



1 



2 



H(v) 



The asymptotic p.d.f. corresponding to the case a = and /i = 1/4 is given in |21) 

The advantage of the asymptotic formula given in Theorem l4.1l is that it provides 
an easy to use expression which compares favorably with finite n results. To further 
highlight this fact, in Fig. 14.11 we compare the analytical asymptotic p.d.f. derived 
in Theorem 14.11 with simulated data points corresponding to a = 1, n = 50, fx = 4 and 
a = 2,n = 50, \i = 4. 

Having characterized k|,(A), we now focus on n 2 E (A). 

5. M.g.f. and p.d.f. of k 2 e (A). Here we give new expressions for the m.g.f 
and the p.d.f. of n 2 E (A). We point out that the key derivation steps developed in 
this section may also be useful for characterizing the distributional properties of more 
general condition number metrics, given in [27 l 128 } [29], 

Let A2 = x. By definition, the m.g.f. of «|;(A) for n > 3, is 



M k 2 e(A) {s) = e 




e °> s * s f (Ai, x, . . . , A n ) d\i ydX^dX^ . . . d\ n >dx 



(5.1) 



where IZ2 = {x < A3 < A4 < ... < A n }. Analogous to the result given in (I3.2[) . the 
following theorem provides an exact simple solution for this m.g.f.: 
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Theorem 5.1. The m.g.f. of k 2 e (A), for n > 3, is given by 



M 



«|(A) 



-s(n-l) 



oo e -( n -l)x x mn-l \ /■! 7 2 p -(l-z)(s+x) 



(x + s y 



(i-zY 



(5.2) 



x det 



L^_ 2 {-z{x + s)) L ( n k -%(-( x + 8)) 



i=i,2,...,o+2 >efe. 

J'=l,2 
fc=3,4,...,<*+2 y 
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Proof. We use (I2.1[) in (|5.ip with some manipulation to obtain 











j e~ x x a < 


L 







1[A';< A 1 £ A, xf TJ (A,-A fc ) 2 



i=3 



i=3 



3<fc<j<n 



Ax - Ai) 2 TT(Aj - Ai) 2 A?e- Al ( 1+ ^)dAa 



d\ 3 . . . dX n > dx , 



where we have used A 2 (A) = (x- Ai) 2 n"= 3 ( A j : ~ A i) 2 lYjLsi^ ~ x ) 2 U 3 <k<j<n( X i - 
Afe) 2 , which is valid for n > 3. Applying the variable transformation Ai = xz to the 
innermost integral yields 



£?=3 *i 



K 2 



II a; a II a, ,t 

J=3 j=3 



TT (Aj - A fc ) if ( — , — , . . . , — ,x] d\ 3 . ..d\ n \ dx, 
- L - L \ii i / 

3<k<j<n ) 



where we have defined: 

A3 A4 A„ 



/ ^(i--) 2 n 



z(s+x *>dz 



By symmetry, we convert the ordered region of integration to an unordered region, 
and subsequently introduce the variable transformations yj — ^t, 8 ' (Xj — x) , j = 
3, 4, . . . , n, to obtain 



M K%{A) (s) 



K 



n,a c -s(n-l) 



(n-2)! 



00 ^ — (n-l)a; ^,m7i— 1 



(a; + s) r 



j=3 3<fc<j<n 



z «(i-z) 2 n 

3=3 



2/j 



+ S 



+ 1-2 ) e- z{s+x Uz 



dy 3 ...dy n }dx. 



Now, changing the order of the innermost integrals and then relabeling the variables 
according to yj — Xj-2, j = 3, 4, . . . , n, yields 



M K 2 (A) (s) = 



K 

_f^n 2 a_ —s(n—l) 



(n-2)! 



00 g — (n — l)x gTnn— 1 

(a: + s) mn - 2a " 4 



z Q (l-z)^e 



2 -z(s+x) 



x (-l)" Q i?(n - 2, -(1 - z)(x + s), -(x + s), a)dz 



dx, (5.3) 



where we have defined 



P n 

R(n,a,b,a) := / TT x 2 ,er X] (a - Xjf (b - Xj) a £<? n (x) dx\ . . . dx n 
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The remainder of the proof is focused on evaluating R(n, a, 6, a). Following 
Eqs. 22.4.2, 22.4.11], we start with the related integral 



[0,oo)« j=1 



1 1 7 ' 



a+2 



Y\_(n - Xj)A^(x) dx\dx 2 ■ ■ - dx n 



i=l 



71 



A a + 2 (r)det [Cn+i-iirj)] 



3=1,2 a+2 

2„-x 



(5.4) 



K n , 2 

where Ck(x) are monic polynomials orthogonal with respect to x 2 e~ x , over < x < 

(2 
J k 

a+2 

IJ ( r * _ ^jO^nW dxidx 2 . . . ete r . 



co. As such, Ck{x) = (—l) k k\Ljp(x), which upon substituting into (|5.4p gives 



[O,oo)» i=1 
n! 



\[r 2 , 



j=l 



Q + l 



dct 



A",, 



> 

j'=o 



L (2) fr-1 



i,j'=l,2,...,ct+2 



A Q+2 (r) 



(5.5) 



In the above formula, the r^s are, in general, distinct parameters. However, if we can 
choose Ti such that 

_ J a if i = 1,2 
ri ~ 1 6 if i = 3, 4,..., a + 2, 

then the left side of (|5.5I) becomes precisely R(n, a,b, a). Under direct substitution, 
however, it turns out that the right-hand side of (15. 5[) then gives a ^ indeterminate 
form. Therefore, the task is to evaluate the following limits: 



a+1 



i?(n, a, 6, a) 



#»,2 



x lim 

rx,r2— >a 
r3 .r4,...,r a +2— >b 



.l)(n-l)a JJ(_l)J+l( n + J -)| 

det [41-1^) 



i,j'=l,2,...,a+2 



A Q+2 (r) 

This can be solved by capitalizing on an approach outlined in [41] which gives 



(5.6) 



det 



lim 

r i ,T2— >a 
r 3 ,r 4 ,...,r a+2 — s-6 



ij = l,2,...,a+2 



A a+2 (r) 



det 



(2) 



dai- 



i=l,2,...,a+2 

J=l,2 
ft=3,4,...,a+2 



det 



d 



fe-3 



dai- 1 



db k ~ 3 



(5.7) 



i=l,2 a+2 

j=l,2 
fc=3,4,...,a+2 



The determinant in the denominator of (15.71) can be written as 
det 



,i-l u -A i_1 



daJ- 



-a 



db k ~ 3 



a-l 

4 =i,2,..., Q + 2 -n^(«- fo ) 2a - 

j=l,2 J=0 
fc=3,4,...,a+2 
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The numerator can also be simplified using (I2.3[) to yield 

-1 , . 



det 



* r( 2 ) ( n ) d " rW 



i=l,2,...,a+2 

.7=1,2 
fc=3,4,...,a+2 



Q + l 

(-1)- JJ(-l)^ det 
j'=o 



i=l,2,....a+2 • 

3=1.2 
fc=3,4,...,a+2 



Substituting these expressions into (|5 . T[) and then the result into (|5.6|) gives 



R(n, a, 6, a) = 



(-i)-"° ni nga^+j 



•det 



(fe-i) 



i=l,2,...,a+2 • 

3 = 1,2 
fc=3,4,...,a+2 



(5.8) 



Now we use this result in (|5.3p . apply the relation J Q f(x)dx = f£ f(a — x)dx in the 
inner integral (i.e., with respect to z), along with some basic manipulations which 
concludes the proof. □ 

A straightforward Laplace inversion of the m.g.f. of «^(A) now gives the p.d.f. 
of Kg (A), which is presented in the following corollary: 

Corollary 5.2. The p.d.f. of k 2 e (A), for n > 3, is given by 



x det 



-(fc-i) 



i=l,2,. ...a+2 dz 

3=1,2 
fc=3,4,...,<*+2 ' 



(5.9) 



Although further simplification of (|5.9p seems intractable for general matrix di- 
mensions m and n, we can obtain a closed- form solution in the important case of 
square Gaussian matrices (i.e., m — n). This is given as follows: 

Corollary 5.3. For a = 0, Ii5.9\) becomes 



(A) ( 



12 



»=Q 3=0 



l) i (-n + 2) j 
(3)i(4)i 



0' + 1 -*)(* + J + 2 ) ! 



^ 2 (-1)*+'+* y-" 2 fa - n + l)" 2 +^-3-4 
x E i -• i o mi , £ .- „■ g(y-n+l) 



fc=0 



(i + j + 2 - jfe)! T (n 2 +k-i-j-3) 



y n jy-n) 

T(n 2 -1) 



n -2 



(5.10) 



It turns out that there is also an interesting connection between the density of 
K%(A) and the density of the second smallest eigenvalue, A2. To see this, we start by 
writing the p.d.f. of A2 as 



h 2 (x) = / / f (Xi,x,...,X n ) dX^XadX^-.-dXr, 
Jn 2 -Jo 
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by definition. This can be further simplified by using very similar techniques to those 
used in the above m.g.f. derivation (we omit the specific details) to yield 

/a 2 0) = r Kn ^,, x 2a+3 e~ {n - 1)x [ z a (l- z) 2 e- zx {-l) na R(n - 2, -(1 - z)x, -x, a)dz. 



(n-2) 

Now, applying (|5.8[) . we obtain the following expression for the p.d.f. of A2 



1 



f X2 ( x ) = x 3 e-( n -V x / z 2 (l - z)- a e-^- z > 







x det 



L^l^i-xz) L^} k (-x) i=1 , 2 ,..., a+2 dz. (5.11) 

J 3=1,2 

k=3A,...,a+2 



An equivalent expression can also be obtained by starting with the joint density of Ai 
and A2 given in [23] . 

Remark 3. We remark that the integrands corresponding to variable z in 15. 2\) . 
i f 5. 9\) and 15.11]) are well defined in the vicinity of z — 1 . 

Combining (|5.9|) and (|5.11[) . we obtain the following interesting connection be- 
tween the densities of k|;(A) and A2: 

,(<*) f ^ _ T(mn) r _x f f\ 2 {s) 



f»m(l/) = — ^M^T • ( 5 - 12 ) 

J kj,(A) yrnn I s mn— 11 v ' 

A previous connection between these two densities was also established in |29j . but 
the result obtained was more complicated. Moreover, it appears difficult to establish 
our simplified connection f|5.12[) as a consequence of the representation in [55] ■ Our 
simplified representation here was made possible by adopting an alternative derivation 
approach based on the m.g.f., in contrast to the p.d.f.-based approach used in |29j . 

It is also worth contrasting the expression (|5.1ip with prior equivalent results in 
the literature, particularly those presented in [HUEI]. First, the result (|5.11[) is more 
compact, and in contrast to prior results in |341 151) it does not involve summations 
of functions over combinatorial partitions of integers. Moreover, whilst both (|5.11l) 
and the previous expressions in |341 151] involve determinants, the representations are 
markedly different — in (|5.11l) . the size of the determinant depends on the difference 
between m and n (i.e., a), whilst in [34, 51 it depends on m explicitly. (Determinantal 
expressions having a similar m-dependence have also been derived for various more 
complicated random matrix ensembles; for example, complex non-central Wishart ma- 
trices [26. and generalized- -F matrices |42|.) This, in turn, gives remarkably simplified 
expressions in our case when m and n are of roughly the same order. For example, 
when a — (i.e., m = n), we have the following simple closed- form expression: 

Corollary 5.4. For a = 0, 15.11\) evaluates to 

a. ( x) . l„v - «.--•* gg '""^;^ 2 '^ + 1 - + 1 + 2)! 

y l tl — x*+i+ 2 - k -e- x ). (5.13) 



X 

\ fe=o 



Having analyzed k|;(A) for finite values of m and n, we now focus our attention 
on the asymptotic behavior of the scaled version of Kg (A). 
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6. Asymptotic characterization of k%(A). Here we investigate how k%(A) 
scales as m and n tend to oo with m — n = a being fixed. Unlike in the previous 
asymptotic analysis, in this case it is most convenient to manipulate the m.g.f. of 
K%(A), rather than the p.d.f. We have the following key result: 

Theorem 6.1. As m and n tend to oo such that a = m — n is fixed, k 2 e (A) scales 
on the order of n 3 . More specifically, the scaled random variable V — k|(A) / (nn 3 ) , 
with /.t G M + being an arbitrary constant, has the following asymptotic p.d.f. as m 
and n tend to oo with a — m — n fixed: 



fv( v ) = ~TT / n " \ a det[gi tj (z,v) flfi,fe-2(l,v)]i=i,2,...,a+2 dz, (6.1) 
A* « Jo U - z ) J=l,2 



1 z 2 



3- 

fc=3,4,...,a+2 



where, for i,j = l,2,...,a + 2, 



« — 1 



Proof. Our strategy is to derive the m.g.f. of using the m.g.f. of fc^(A) given 
in (|5.2p . Subsequent application of the limits on m and n followed by the Laplace 
inversion will then yield the desired asymptotic p.d.f. As such, we can write 



3.2) 



We first employ (|2.2p . and use similar arguments as in the derivation of p. lip , to 
write the determinant in (15.21) as 



det 



L 



(i+i) 



„ + i-j-2(- z ( x + s )) L n+i-k(-( x + s )) 
2!3!(n + a - l)!(n + a - 2)! n"= 3 2 (n + a + 2 - j)! 

n+a— 1 n+a— 2 n+a — 1 n+a — 2 n + a — 3 n / , i \ / , r*\ 

x V" V" (-n-a + l)/ 1 (-Ti-a + 2) i 



2=1,2,. ...Q + 2 

.7=1,2 
fe=3,4,...,a+2 



h=0 i 2 =0 h=0 U=0 h=0 
J 



X z 



x det 



+ 1 — 2 a+1— i 

n & - o n - o 



(3) il (4) i2 Z 1 !Z 2 ! 

x + s)) ifc + 2 



L (=0 



;=o 



i=l,2,....a+2 
fc=3,4,...,a+2 



(6.3) 



where = n + a — j — lj and Wk = n + a + 2 — k — Ik- Now, invoking Lemma lA.21 to 
compute the remaining determinant, we use the resulting expression along with 
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hi (d.Jj) to obtain the m.g.f. of as 



M , 



(n + a) 2 (n + a + l)(n + a - 1) _ * r n _i) 
3 (A)(s) = =-7n e ^ v 



2!3!n"i 2 a-l)! 



oo — (n— l)a; n(n+a) — 1 



fi if 



n(n+a) — 4 



(l-z)- Q z 2 e - (1 - z) (7^ +3; ) 



n+a — 1 n+a — 2 n+a— 1 n+a — 2 n+a— 3 



>< E E E E E - E 

; 1= o l 2 =0 l 3 =0 u=o l 5 =0 l a +2=0 

lk + 2 



(-n-a + l) h (-n-a + 2)i 2 h+h 



X 



-q (-n - a + 



fc + 2 



(fc + 2) ;fc+2 ? fc+2 ! 



/in 



a; H t - x H ? A a+2 (w) ciz > 



/in" 



Then, we use the variable transformation t — xn, change the order of integration, and 
apply elementary limiting arguments as n —¥ oo to obtain 



lim M K 2 fA1 (s) = -ro 

— ^ W 2!3!n; = + 3 2 (j-l)! 



„1 OO OO OO 

/ d-^EE- E 

J ; n 7 n 7 



2 1= OZ 2 =0 i Q+ 2=0 



z' 1+ ' 2 A a+2 (w) 
(3) (l (4) i2 / 1 !Z 2 ! 



n 



i 



3 f ^t 3 +^?=i 2 ^ eft \ dz. 



Using the variable transformation a; = we compute the inner integral, and subse- 
quently perform a Laplace inversion to yield the limiting p.d.f. of V = : 



OO OO 



(i-^EE- E 



i+^A Q+2 (w) 



n 



i 



v/i ^ (k + 2) lk+2 l k+2 l {iiv) l *+! 



l 1= 0l 2 =0 l a +2=0 

I dz 



(3) h (A) h h\l 2 \ (jivy> 



— i- f i 

e f° 



2!3!n"= 3 (i-l)! ^ v Jo 



x det 



E 



(l-z)- a z 



7 E 



OO I — I 



i=l,2,....a+2 

3=1,2 
fc=3,4,...,a+2 



Recalling the relations 

z 3=i + l n .7 = 1,2, 

tu,- = j + Ij - 2, i = 3, 4, . . . , a + 2 , 
with some manipulation then gives 



1 



2!3!n?= 3 (i-l)! 



(1-z) a z 2 det j (z,v) </ ijfc _ 2 (l, u)]t=i,2,...,a+2 dz, 

3=1,2 
fe=3,4 a+2 

(6.4) 
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where 

Noting the fact that l p - = J2q=o ^phQj~^) • • ' an d making the identification 

oo u 

£ {a)k{ l_ N)l = (« - 1)! x**^ W_ 1 (2^) 

we arrive at 

Finally, noting the fact that <ji_j(z,v) = (j + 1)! gij(z,v), i,j = 1,2, . . . , a + 2, and 
using (|6.5[) in (16.41) with some algebraic manipulation concludes the proof. □ 

For a = 0, we have the following remarkably simple closed-form solution: 

Corollary 6.2. For a — 0, Ii6.1\) becomes 

fv(v) = iL L nT fie 2^3 U 7 --AA,6;—)- — iF 2 fj;5,7;— ) 
57b fi^v a \ V 2 fxv y f/,v \2 pm J 

+ — 3*4 4, 4; 3, 5, 5, 7;—)) , 

where p F q (ai, ci2, . . . , a p ; b±, &2 • • • , b q ; z) is the generalized hypergeometric function. 
This is easily obtained by setting a = and integrating using [SHI Eq. 2.15.19.1] with 
some basic algebraic manipulations. 

In Fig. 16.11 we illustrate the applicability of the analytical asymptotic results in 
the finite context. Specifically, we compare the analytical asymptotic p.d.f. given in 
Theorem 16.11 with that of simulated points corresponding toa = 0,n= 10, /i = 4 and 
a = 1, n = 50, fi = 4. The close agreement is clearly apparent. 



Appendix A. Some Useful Determinant Results. 

The following results are useful for the proofs of Theorems 13.31 and 16.11 For nota- 
tional convenience we denote Ajv(x) = ni<;<fe<iv ~ x where x = {xi,X2, ■ ■ ■ , a; at} 
withAi(x) = l. 

Lemma A. 1. Let n,a e Z+ with n,a > 1. Then 



det 



a-k-l 



n g-o 



= A«(c) 



fe,/=l,2,. 



where ci=n + a- l-ji~ I, c = {ci(ji), c 2 (j 2 ), • ■ • , c Q (i Q )} and q(j';) = Z + j; with 
ji = 0, 1, . . . , n + a — 1 — I. 
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25 r 



Simulation 

Analytical Asymptotic 




(a) a = and n = 10 



0.02 0.04 0.06 

v 



0.08 

(b) a = 1 and n = 50 




0.1 0.12 



Fig. 6.1. Comparison of simulated data points and the analytical p.d.f. fy (v) for two different 
n, a settings with fi = 4. 



Proof. Let us rewrite the determinant as 



det 



i-fc-i 

n 



i=0 



k,l=l,2,...,a 



Rfr-i) life 

ct — 3 a— 3 

n^-o rife 



i=0 



i=0 



Cl 
1 



C2 
1 



a-2 



n ^ a ~ 

i=0 

s-3 

n fe ~ ^ 



i=0 
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Now the repeated application of the following row operations 



fcth row — > fcth row + S^f £ % ' x (k + i)th row , i = 1,2, . . . , a — 1 — k 
on each row, from k — 1 to k = a — 2, of the above determinant gives 



det 



k,l=l,2,...,a 



n 

. i=0 



j=0 i=0 
a-2 3-1 



3=0 
ct-2 



i=0 



E^IK 5 *-*) E^II^ 

3=0 i=0 3=0 



Cl 

1 



i=0 



C2 
1 



Es^nV-o 

3=0 i=0 
a-2 J-l 

E^n^-*) 

j=0 j=0 



1 



We may use Lemma 12.41 to simplify each entry of the above determinant to obtain 



det 



oi—k—1 



n c* 



i=0 



fc./=l,2, 



~CK— 1 ~CK — 1 



~a-2 ~a-2 
C n Co 



Cl c 2 

1 1 



= (-l) LfJ det[cf- 1 ] M=li 
where ['J is the floor function. Noting the relation 

c k = n + a - 1 - c k (jk) , k = 1,2, ...,ce , 

we obtain 



det 



a-fc-l 



n &-o 



L i=0 



J fc,Z=l,2, 



= (-l)LfJdet [r 1 ]^....^ 
= (-l)L§J [] (c fe -q) 

l<Z<fc<Q 

= (-i)l4j (cfcO'O-cjOj)) 



i<;<fe<ct 



A Q (c) 



where we have used the fact that |_^rj is an even number for a = 1, 2, . . .. □ 
Lemma A. 2. Let n e Z + and oeZ+U {0} wit/i n > 2. TTiera 



det 



a+1 — i 



a+1— £ 



n (% -i) n - o 



i=0 



i=0 



i=l,2,...,a+2 

3 = 1,2 
fe=3,4,...,a+2 



= A Q+2 (w) 
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where w = [z\, z%, W3, . . . , w a+ 2) and 



Zj =j + lj, 

w k = k + l h - 2, 



with 



Z; = n + a — z ; 



J = 1,2 



u>k = n + a — Wk, k = 3, 4, . . . , a + 2 , 



Zj- =0, l,...,n + a-j, i = l,2, 

Z/s = 0, 1, . . . , n + a + 2 - fc, A; = 3, 4, . . . , a + 2 



Proof. 



det 



"q + 1 — i 

n 



a+l— i 

n &-o n (^- ? ) 
1=0 



i=0 



i=0 
a-1 



i=0 



^1 
1 



i=l,2,....a+2 

3=1,2 
fc=3,4,...,a+2 



a-1 



i=0 



i=0 



i=0 

Q-l 

i=0 



^2 
1 



•«'3 
1 



W a +2 
1 



Now, repeatedly applying the following row operations 

fcth row — > fcth row + s| J a ) ] l 2 2 J fe ' £ ~ l ' ) (fc + i)th row, i = 1,2, . . . , a + 1 — fc 
on each row (i.e., fc = 1, 2, . . . , a) and using Lemma 12.41 gives 

a+l — i 



det 



a+l— £ 



II&-0 11(^-0 



/=0 



;=o 



i=l,2,... .a+2 

3 = 1,2 
fc=3,4,...,a+2 



~a+l ~a+l ,r,a+l 



~ a+l 



~a ~a 



Zl Z2 
1 1 



u 3 



«>3 
1 



a+2 
^+2 



Wa+2 
1 



a+2 ■ 

3=1,2 
fc=3,4,...,a+2 

Interestingly, the resultant simplified determinant is of Vandermonde type. This in 
turn gives 

(-1) L ^ J det [zj -1 v%~ 1 ] i =i,2,...,a+2 =det[z^ 1 w^ 1 ] l =i, 2 ,...,a+ 2 =A Q+2 (w). 

3=1,2 3=1,2 
fc=3,4,...,a+2 fe=3,4,...,a+2 
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